GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



January 9, 2004, 21:11:01 ; Search time 9362 Seconds 

(without alignments ) 
11483.711 Million cell updates/sec 

US-09-919-831-1 

2628 

1 cggtcacgttgggatcgttg ggcgacgagggtggagctca 2628 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl : * 
gb_ba : * 
gb_ht g : * 
gb_in : * 
gb_om : * 
gb_ov : * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb ro:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 



Sacked SlQ 



11 : gb_sts : * 

12 : gb_sy : * 

1 3 : gb_un : * 

14: gb_vi:* 

15 : em_ba : * 

16 : em_f un : * 

17 : em_hum: * 

18: ern_in;* 

1 9 : em__mu : * 

2 0 : em_om : * 
21 : em_or : * 
22 : em_ov : * 
23 : em_pat : * 
24 : em_ph: * 
25: em_pl:* 

2 6 : em_jro : * 

27 : em sts : * 



28 


: em 


un : * 


29 


: em_ 


vi : * 


30 


: em 


_htg_hum: * 


31 


: em 


_htg inv:* 


32 


: em_ 


_htg_other : * 


33 


em 


Jitg mus : * 


34 


em 


_htg_pln: * 


35 


em 


_htg rod:* 


36 


em 


htg mam: * 


37 


em 


htg vrt : * 


38: 


em 


sy : * 


39: 


em 


_htgo hum: * 


40: 


em 


_htgo mus : * 


41 : 


em 


htgo other:* 



Preci. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB 



ID 





1 


2628 


100 


.0 


2628 


c 


2 


2628 


100 


. 0 


320550 


c 


3 


2628 


100 


.0 


349980 




4 


1168 


. 4 


44 


.5 


1170 




5 


1168 


. 4 


44 


. 5 


1170 




6 


1146 


43 


. 6 


1146 




7 


1146 


43 


. 6 


1146 




8 


567 


21 


. 6 


567 




9 


567 


21 


. 6 


567 




10 


485 


18 


.5 


762 




11 


485 


18 


.5 


762 


c 


12 


443 


16 


. 9 


1041 


c 


13 


443 


16 


. 9 


1041 




14 


440 


. 6 


16 


. 8 


982 




15 


370 


14 


.1 


402 


c 


16 


353 


.4 


13 


4 


834 


c 


17 


295 


. 4 


11 


2 


302070 


c 


18 


240 


9 


1 


240 


c 


19 


240 


9 


1 


240 




20 


138 


2 


5, 


3 


346294 


c 


21 


132 


6 


5 . 


0 


11425 


c 


22 


129 


6 


4 . 


9 


11093 




23 


129 


6 


4 . 


9 


13715 


c 


24 


127 


6 


4 . 


9 


299750 


c 


25 


116 


4 


4 . 


4 


7186 




26 


113. 


2 


4 . 


3 


11699 




27 


110. 


8 


4 . 


2 


216750 




28 


110. 


2 


4 . 


2 


1218 




29 


110. 


2 


4 . 


2 


1218 




30 


110. 


2 


4 . 


2 


1326 




31 


110. 


2 


4 . 


2 


1326 




32 


110. 


2 


4 . 


2 


1326 




33 


110. 


2 


4 . 


2 


1831 




Description 



AX394263 <0* te/MW* 
AP005282 Gca&*kL AP0OSJS2 

AX127152 £f llOt 770 
AX063913 



AX244087 
AX123145 
BD165262 
AX123146 
BD165263 
AX123144 
BD165261 
AX123143 
BD165260 
AX394270 
AX394269 
AX065835 
1 AP005223 
6 AX123147 
6 BD165264 
1 AP002999 
1 AE001919 
1 AE008272 
1 AE009343 
1 AP005964 
1 AE009445 
1 AE014493 
1 BSUB0007 
9 BT006882 
12 BT007589 
6 AR144980 
6 AX050468 
9 S52784 
9 BC015807 



AX394263 Sequence 
AP005282 Corynebac 
AX127152 Sequence 
AX063913 Sequence 
AX244087 Sequence 
AX123145 Sequence 
BD165262 Novel pol 
AX123146 Sequence 
BD165263 Novel pol 
AX123144 Sequence 
BD165261 Novel pol 
AX123143 Sequence 
BD165260 Novel pol 
AX394270 Sequence 
AX394269 Sequence 
AX065835 Sequence 
AP005223 Corynebac 
AX123147 Sequence 
BD165264 Novel pol 
AP002999 Mesorhizo 
AE001919 Deinococc 
AE008272 Agrobacte 
AE009343 Agrobacte 
AP005964 Bradyrhiz 
AE009445 Brucella 
AE014493 Brucella 
Z99110 Bacillus su 
BT006882 Homo sapi 

BT007589 Synthetic 
AR144980 Sequence 
AX050468 Sequence 
S52784 cystathioni 
BC015807 Homo sapi 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on : 



January 9, 2004, 19:54:00 ; Search time 672 Seconds 

(without alignments) 
10556.731 Million cell updates/sec 



Title: US- 09-91 9-831-1 

Perfect score: 2628 

Sequence: 1 cggtcacgttgggat cgttg ggcgacgagggtggagctca 2 62 £ 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 2552756 seqs, 1349719017 residues 

Total number of hits satisfying chosen parameters: 5105512 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



N_Geneseq_19Jun03 : * 

/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/S IDS 1/gcgdat a/genes eq/ geneseqn-embl/NAl98 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/SIDSl/gcgdata /gene seq/geneseqn-embl/NAl 9 8 
/ SI DS 1/gcgdat a/genes eq/geneseqn-embl/NAl 9 8 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA198 
/ SI DS 1/gcgdat a /genes eq/geneseqn-embl/NAl 9 8 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NAl98 
/S IDS 1/gcgdat a/ gene seq/geneseqn-embl/NAl 9 
/SIDSl/gcgdata/geneseq/geneseqn-embl/NA19 
/S IDS 1/gcgdat a /gene seq/geneseqn-embl/NAl 9 
/SIDSl/gcgdata/ geneseq/ geneseqn-embl/NAl 9 
/S IDS 1/gcgdat a /gene seq/geneseqn-embl/NAl 9 
/SIDSl/gcgdata /gene seq/geneseqn-embl/NAl 9 
/SIDSl/gcgdata /gene seq/ geneseqn-embl/NAl 9 
/SIDSl/gcgdata/geneseq/ geneseqn-embl/NAl 9 
/SIDSl/gcgdata /gene seq/ geneseqn-embl/NAl 9 
/SIDSl/gcgdata/geneseq/ geneseqn-embl/NAl 9 
/S IDS 1/gcgdat a /geneseq/geneseqn-embl/NAl 9 
/SIDSl/gcgdata/geneseq/ geneseqn-embl/NA20 
/S IDS 1/gcgdat a /geneseq/geneseqn-embl/NA20 
/S IDS 1/gcgdat a /geneseq/geneseqn-embl/NA2 0 
/S IDS 1/gcgdat a /geneseq/geneseqn-embl/NA2 0 
/SIDSl/gcgdata/ gene seq/geneseqn-embl/NA20 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



0 . DAT : * 

1 . DAT : * 
DAT: * 
DAT: * 
DAT : * 
DAT: * 

6 . DAT : * 

7 . DAT : * 

8 . DAT : * 

8 9 . DAT : * 

90. DAT: * 

91 . DAT: 

92 . DAT: 
93 . DAT; 

94 . DAT: 

95 . DAT: 

96 . DAT: 

97 . DAT: 
. DAT: 

99 . DAT: 
00 . DAT: 
01A . DAT : * 
01B . DAT : * 

02. DAT :* 

03 . DAT: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB 



ID 



Description 





1 


2628 


100 


. 0 


2628 


24 


AAD34148 


c 


2 


2628 


100 


.0 


349980 


22 


AAH68533 




3 


1168.4 


44 


. 5 


1170 


22 


AAF71850 




4 


1168 . 4 


44 


. 5 


1170 


23 


AAS96112 




5 


1146 


43 


. 6 


1146 


22 


AAH68026 




6 


567 


21 


. 6 


567 


22 


AAH68027 




7 


567 


21 


. 6 


567 


25 


ACA00349 




8 


485 


18 


.5 


762 


22 


AAH68025 


c 


9 


443 


16 


. 9 


1041 


22 


AAH68024 




10 


440.6 


16 


. 8 


982 


24 


AAD34153 




11 


370 


14 


. 1 


402 


24 


AAD34152 


c 


12 


353. 4 


13 


. 4 


834 


22 


AAF71227 


c 


13 


240 


9 


. 1 


240 


22 


AAH68028 


c 


14 


240 


9 


. 1 


240 


25 


ACA00365 


c 


15 


222 


8 


. 4 


222 


25 


ACA00358 




16 


110.2 


4 


.2 


1326 


22 


AAC91230 




17 


110.2 


4 


.2 


1924 


23 


ABV25287 




18 


99.4 


3 


. 8 


1779 


22 


AAS03014 




19 


97.6 


3 


. 7 


1733 


20 


AAV99909 




20 


94 . 6 


3 


. 6 


1185 


23 


AAS52784 


c 


21 


93. 6 


3 


. 6 


349980 


24 


ABQ81847 




22 


93. 6 


3 


6 


4403765 


22 


AAI99683 




23 


93. 6 


3 


6 


4411529 


22 


AAI99682 


c 


24 


87.4 


3 


3 


2944528 


24 


ABA03041 


c 


25 


86.6 


3 


3 


31702 


24 


ABQ67190 


c 


26 


86.6 


3 


3 


3011208 


24 


ABQ69245 




27 


86.2 


3 


3 


1302 


23 


AAS56046 




28 


86.2 


3 


3 


1395 


22 


AAH43576 




29 


86.2 


3 


3 


1395 


22 


AAA89292 




30 


86.2 


3. 


3 


1644 


21 


AAC37291 




31 


86 


3. 


3 


78845 


21 


AAA81463 


c 


32 


86 


3. 


3 


349980 


21 


AAF21608 


c 


33 


86 


3. 


3 


1437668 


21 


AAA81490 




34 


85.8 


3. 


3 


2411 


24 


ABQ70794 




35 


85.6 


3. 


3 


1209 


21 


AAC38726 




36 


84 . 8 


3. 


2 


1889 


21 


AAC36342 




37 


84.2 


3. 


2 


1161 


23 


AAS52693 




38 


84 


3. 


2 


1176 


25 


ABA00778 




39 


84 


3. 


2 


1398 


25 


ABA00777 




40 


84 


3. 


2 


1488 


25 


ABA00780 




41 


84 


3. 


2 


1692 


24 


ABZ14817 




42 


84 


3. 


2 


1692 


25 


ABA007 7 9 




43 


83 


3. 


2 


2365589 


24 


ABA90521 




44 


82.4 


3. 


1 


1129 


21 


AAC34676 




45 


82 


3. 


1 


1092 


25 


ABX07272 



too ox/i$<fio 



Corynebacterium gl 
C glutamicum codin 
Corynebacterium gl 
C . glutamicum gene 
C glutamicum codin 
C glutamicum codin 
C. glutamicum deri 
C glutamicum codin 
C glutamicum codin 
Corynebacterium gl 
Corynebacterium gl 
Corynebacterium gl 
C glutamicum codin 
C. glutamicum deri 
C. glutamicum deri 
Human gamma-cystat 
Human prostate exp 
Human diagnostic a 
Fragment of cystat 
Enterococcus faeca 
Bifidobacterium lo 
Mycobacterium tube 
Mycobacterium tube 
Listeria monocytog 
Listeria innocua c 

Listeria innocua D 
Salmonella typhi D 
Cystathionine beta 
Arabidopsis cystat 
Arabidopsis thalia 
N. meningitidis pa 
Neisseria meningit 
N. meningitidis B 
Listeria monocytog 
Arabidopsis thalia 
Arabidopsis thalia 
E. coli DNA for ce 
N-terminal domain- 
90 bp domain delet 
Transit peptide de 
Arabidopsis thalia 
Full length Arabid 

Genomic sequence o 
Arabidopsis thalia 
S. pneumoniae type 



ALIGNMENTS 
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OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score; 
Sequence : 

Scoring table: 



Searched: 



January 9, 2004, 21:54:45 ; Search time 180 Seconds 

(without alignments ) 
6444.194 Million cell updates/sec 

US-09-919-831-1 
2628 

1 cggtcacgttgggatcgttg ggcgacgagggtggagct ca 2628 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



569978 seqs, 220691566 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1139956 



Post -processing : 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Issued_Patents NA:* 

/cgn2 6/ptodata/2/ ina/5A COMB. seq:* 
/cgn2 6/ptodata/2/ ina/5B COMB. seq:* 
/cgn2__6/ptodata/2/ina/6A~COMB. seq: * 
/cgn2 6/ptodata/2/ ina/6B COMB. seq: * 
/cgn2_6/ptodata/2/ina/PCTUS_COMB. seq: * 
/cgn2_6/ptodata/2/ina/backf ilesl . seq: * 



Pred. No, 



is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Description 

Sequence 28, Appl 
Sequence 139, App 
Sequence 2, Appli 
Sequence 1 , Appli 
Sequence 3, Appli 
Sequence 61, Appl 
Sequence 1, Appli 
Sequence 1, Appli 
Sequence 4, Appli 
Sequence 13 , Appl 
Sequence 4 , Appli 
Sequence 4 08, App 



Result 




Query 








No. 


Score 


Match 


Length 


DB 


ID 


1 


110.2 


4 


. 2 


1326 


3 


US-09-318-448-28 


2 


97.2 


3 


. 7 


36800 


4 


US-08-311-731A-139 


3 


93. 6 


3 


. 6 


4403765 


3 


US-09-103-840A-2 


4 


93. 6 


3 


. 6 


4411529 


3 


US-09-103-840A-1 


5 


86.2 


3 


3 


1395 


3 


US-09-586-719-3 


c 6 


82 


3 


1 


11864 


4 


US-08-961-527-61 


c 7 


80.2 


3 


1 


1830121 


4 


US-09-557-884-1 


c 8 


80.2 


3. 


1 


1830121 


4 


US-09-643-990A-1 


9 


77.8 


3. 


0 


1615 


1 


US-08-176-413-4 


10 


77 . 8 


3. 


0 


1615 


4 


US-09-347-878-13 


11 


77. 8 


3. 


0 


1615 


5 


PCT-US94-14919-4 


12 


73.8 


2 . 


8 


1392 


4 


US-09-252-991A-408 
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OM nucleic - nucleic search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 

Scoring table : 



January 9, 2004, 22:06:10 ; Search time 916 Seconds 

( without alignments ) 
9962.912 Million cell updates/sec 

US-09-919-831-1 

2628 

1 cggtcacgttgggatcgttg ggcgacgagggtggagctca 2628 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 2276164 seqs, 1736306516 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



4552328 



Database 



Published 
/cgn2_ 
/ cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2_ 
/cgn2 
/cgn2 
/cgn2 
/cgn2 
/cgn2 
/cgn2 
/cgn2 
/ cgn2 
/cgn2' 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 



Applicat 
_6/ptodata 
6/ptodata 
6/ptodata 
6/ptodata 
6/ptodata 
6/ptodata 
6/ptodata 
6/ptodata 
6/ptodata 
_6/ptodat 
_6/ptodat 
__6/ptodat 
_6/ptodat 
_6/ptodat 
_6/ptodat 
_6/ptodat 
_6/ptodat 
_6/ptodat 



ions_NA : 
/2/pubpna/ 
/2/pubpna/ 
/2/pubpna/ 
/2/pubpna/ 
/2/pubpna/ 
/2/pubpna/ 
/2/pubpna/ 
/2/pubpna/ 
/2/pubpna/ 
a/2/pubpna 
a/2/pubpna 
a/2/pubpna 
a/2/pubpna 
a/2/pubpna 
a/2/pubpna 
a/2/pubpna 
a/2/pubpna 
a/2/pubpna 



US07_PUBCOMB.seq: * 
PCT_NEW_PUB . seq : * 
US0 6_NEW_PUB.seq: * 
US06_PUBCOMB. seq: * 
US07_NEW_PUB.seq: * 
PCTUS_PUBCOMB. seq: * 
US08_NEW_PUB.seq: * 
US08_PUBCOMB. seq: * 
US09A_PUBCOMB. seq: * 
/US09B_PUBCOMB. seq: > 
/US09C_PUBCOMB. seq: * 
/US0 9_NEW_PUB. seq: * 
/US09_NEW_PUB . seq2 : * 
/USIOA^PUBCOMB. seq: •* 
/US10B_PUBCOMB. seq: * 
/US10_NEW_PUB. seq: * 
/US60_NEW_PUB. seq: * 
/US60_PUBCOMB.seq:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


2628 


100 


. 0 


2628 


10 


US-09-919-831-1 


Sequence 1, Appli 


2 


2628 


100 


. 0 


3309400 


10 US-09-738-626-1 


Sequence 1, Appli 


3 


1168 . 4 


44 


. 5 


1170 


11 


US-09-746-660A-79 


Sequence 7 9, Appl 


4 


1146 


43 


. 6 


1146 


10 


US-09-738-626-3061 


Sequence 3061, Ap 


5 


567 


21 


. 6 


567 


10 


US-09-738-626-3062 


Sequence 3062, Ap 


6 


485 


18 


. 5 


762 


10 


US-09-738-626-3060 


Sequence 3060 , Ap 


7 


443 


16 


. 9 


1041 


10 


US-09-738-626-3059 


Sequence 3059, Ap 


8 


440.6 


16 


. 8 


982 


10 


US-09-919-831-8 


Sequence 8, Appli 


9 


370 


14 


. 1 


402 


10 


US-09-919-831-7 


Sequence 7 , Appli 


10 


240 


9 


. 1 


240 


10 


US-09-738-626-3063 


Sequence 3063, Ap 


11 


110. 8 


4 


. 2 


1173 


12 


US-10-369-493-41386 


Sequence 41386, A 


12 


100 . 8 


3 


. 8 


1173 


12 


US-10-369-493-40920 


Sequence 40920, A 


13 


100. 6 


3 


. 8 


1581 


10 


US-09-931-457A-67 


Sequence 67, Appl 


14 


99.6 


3 


. 8 


1140 


12 


US-10-369-493-24163 


Sequence 24163, A 


15 


97 . 6 


3 


. 7 


1733 


10 


US-09-931-457A-37 


Sequence 37 , Appl 


16 


97 


3 


. 7 


1699 


10 


US-09-931-457A-71 


Sequence 71, Appl 


17 


95. 4 


3 


. 6 


1125 


12 


US-10-369-493-32005 


Sequence 32005, A 


18 


94 . 6 


3 


. 6 


1185 


9 


US-09-815-242-6421 


Sequence 6421, Ap 


19 


94 .2 


3 


6 


1685 


10 


US-09-931-457A-69 


Sequence 69, Appl 


20 


93. 6 


3 


6 


1089 


12 


US-10-369-493-31668 


Sequence 31668, A 


21 


90 


3 


4 


1053 


12 


US-10-369-493-37724 


Sequence 37724, A 


22 


89.8 


3 


4 


789 


10 


US-09-931-457A-33 


Sequence 33, Appl 


23 


89 


3 


4 


1128 


12 


US-10-369-493-44368 


Sequence 44368, A 


24 


88.2 


3 


4 


1152 


15 


US-10-156-761-3292 


Sequence 32 92, Ap 


25 


88.2 


3 


4 


9025608 


IS 


US-10-156-761-1 


Sequence 1, Appli 


26 


87 . 4 


3 


3 


1140 


12 


US-10-369-493-40221 


Sequence 40221, A 


27 


87 . 2 


3 


3 


1143 


12 


US-10-369-493-32030 


Sequence 32030, A 


28 


86.4 


3 


3 


1191 


12 


US-10-369-493-27630 


Sequence 27630, A 


29 


86.2 


3 


3 


1302 


9 


US-09-815-242-9683 


Sequence 9683, Ap 


30 


84.4 


3 . 


2 


1119 


12 


US-10-369-493-40265 


Sequence 40265, A 


31 


84 . 2 


3 . 


2 


1161 


9 


US-09-815-242-6330 


Sequence 6330, Ap 


32 


84.2 


3 . 


2 


1161 


12 


US-10-369-493-47328 


Sequence 47328, A 


33 


84 


3 . 


2 


1692 


10 


US-09-938-842A-2622 


Sequence 2 622, Ap 


34 


83.6 


3. 


2 


1071 


12 


US-10-369-493-42984 


Sequence 42984, A 


35 


83 


3. 


2 


1143 


12 


US-10-369-493-42062 


Sequence 42062, A 


36 


82.8 


3. 


2 


1218 


12 


US-10-369-493-41227 


Sequence 41227, A 


37 


82. 6 


3. 


1 


1116 


12 


US-10-369-493-35572 


Sequence 35572, A 


38 


82 


3. 


1 


1095 


9 


US-09-815-242-9554 


Sequence 9554, Ap 


39 


81.6 


3. 


1 


1074 


12 


US-10-369-493-42985 


Sequence 42985, A 


40 


80.8 


3. 


1 


1158 


10 


US-09-738-626-3491 


Sequence 3491, Ap 


41 


80.2 


3. 


1 


1146 


9 


US-09-815-242-6865 


Sequence 6865, Ap 


42 


80.2 


3. 


1 


1830121 


15 


□S-10-329-960-1 


Sequence 1, Appli 


43 


79.6 


3. 


0 


1122 


12 


US-10-369-493-32861 


Sequence 32861, A 


44 


79.4 


3. 


0 


1056 


12 


US-10-369-493-44385 


Sequence 44385, A 


45 


78.8 


3. 


0 


1089 


12 


US-10-369-493-42189 


Sequence 42189, A 



ALIGNMENTS 



RESULT 1 
US-09-919-831-1 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on : 



Title: 

Perfect score: 
Sequence : 



January 9, 2004, 21:45:20 ; Search time 4931 Seconds 

(without alignments) 
12953.187 Million cell updates/sec 

US-09-919-831-1 

2628 

1 cggtcacgttgggatcgttg ggcgacgagggtggagctca 2628 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



22781392 seqs, 12152238056 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



45562784 



Database 



EST: 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10: 
11: 
12 : 
13: 
14 : 
15: 
16: 
17: 
18: 
19: 
20: 
21: 
22: 
23: 
24: 
25: 
26: 
27: 
28: 



em 

em 

em 

em 

em 

em 

em 

em 

gb 
gb 
gb 
gb 
gb 
gb 
em 
em 
em 
em 
em 
em 
em 
em 
em 
em 
em 
em 
em 

gb" 



estba : * 
esthum: * 
estin : * 
estmu : * 
estov : * 
estpl : * 
estro : * 
htc: * 
estl: * 
est2 : * 
htc: * 
est3: * 
_est4 : * 
_est5 : * 
i_estf un : * 
i_e s t om : * 
t_gss_hum: * 
i_gss_inv: * 
i_gss_pln : * 
i_gss_vrt : * 
i_gss_fun: * 
i_gss_mam: * 



i_gss_mus : * 
gss_pro : * 
_gss__rod: * 
gss_phg : * 
gss_vrl : * 
gssl : * 



29: gb_gss2:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 





1 


102 . 4 


3 


. 9 


675 


14 


CA757379 


CA757379 


BR0500100 




2 


100 . 6 


3 


. 8 


2122 


11 


AY109661 


AY109661 


Zea mays 




3 


99. 8 


3 


. 8 


563 


10 


BE493057 


BE493057 


WHE0562 H 


c 


4 


99. 6 


3 


. 8 


755 


12 


BJ373401 


BJ373401 


BJ373401 




5 


99.2 


3 


. 8 


680 


13 


BW055892 


BW055892 


BW055892 




6 


98 . 8 


3 


. 8 


541 


13 


BW215645 


BW215645 


BW215645 




7 


98 . 6 


3 


. 8 


1137 


14 


CA616422 


CA616422 


wrl .pk009 


c 


8 


98 . 2 


3 


. 7 


742 


12 


BJ436133 


BJ436133 


BJ436133 


c 


9 


98 


3 


. 7 


672 


12 


BJ374847 


BJ374847 


BJ374847 


c 


10 


98 


3 


. 7 


673 


12 


BJ429423 


BJ429423 


BJ429423 


c 


11 


98 


3 


. 7 


673 


12 


BJ432935 


BJ432935 


BJ432935 


c 


12 


98 


3 


. 7 


691 


12 


BJ400379 


BJ400379 


BJ400379 


c 


13 


98 


3 


7 


692 


12 


BJ400202 


BJ400202 


BJ400202 


c 


14 


98 


3 


7 


693 


12 


BJ376289 


BJ376289 


BJ376289 


c 


15 


98 


3 


7 


700 


12 


BJ428764 


BJ428764 


BJ428764 


c 


16 


98 


3 


7 


715 


12 


BJ429125 


BJ429125 


BJ429125 


c 


17 


98 


3 


7 


715 


12 


BJ432663 


BJ432663 


BJ432663 


c 


18 


98 


3 


7 


725 


12 


BJ435608 


BJ435608 


BJ435608 


c 


19 


98 


3 


7 


730 


12 


BJ430250 


BJ430250 


BJ430250 


c 


20 


98 


3 


7 


732 


12 


BJ429808 


BJ429808 


BJ429808 


c 


21 


98 


3 


7 


739 


12 


BJ372181 


BJ372181 


BJ372181 


c 


22 


98 


3 


7 


742 


12 


BJ433120 


BJ433120 


BJ433120 


c 


23 


98 


3 


7 


743 


12 


BJ434572 


BJ434572 


BJ434572 


c 


24 


98 


3 


7 


761 


12 


BJ434238 


BJ434238 


BJ434238 


c 


25 


98 


3. 


7 


762 


12 


BJ372619 


BJ372619 


BJ372619 


c 


26 


98 


3 . 


7 


762 


12 


BJ429747 


BJ429747 


BJ429747 


c 


27 


98 


3 . 


7 


778 


12 


BJ399937 


BJ399937 


BJ399937 


c 


28 


98 


3 . 


7 


813 


12 


BJ372197 


BJ372197 


BJ372197 


c 


29 


96.4 


3. 


7 


721 


12 


BJ428519 


BJ428519 


BJ428519 


c 


30 


96.4 


3. 


7 


737 


12 


BJ398392 


BJ398392 


BJ398392 


c 


31 


95.8 


3. 


6 


701 


12 


BJ398232 


BJ398232 


BJ398232 




32 


95. 6 


3. 


6 


641 


13 


BW267916 


BW267916 


BW267916 




33 


94.2 


3. 


6 


918 


28 


AZ546359 


AZ546359 


ENTEL37TR 


c 


34 


94 


3. 


6 


721 


12 


BJ377576 


BJ377576 


BJ377576 


c 


35 


93. 6 


3. 


6 


659 


12 


BJ373114 


BJ373114 


BJ373114 


c 


36 


93. 6 


3. 


6 


660 


12 


BJ428697 


BJ428697 


BJ428697 


c 


37 


93.6 


3. 


6 


660 


12 


BJ430241 


BJ430241 


BJ430241 


c 


38 


93.6 


3. 


6 


660 


12 


BJ433748 


BJ433748 


BJ433748 


c 


39 


93. 6 


3. 


6 


667 


12 


BJ399583 


BJ399583 


BJ399583 


c 


40 


93.6 


3. 


6 


667 


12 


BJ428675 


BJ428675 


BJ428675 


c 


41 


93. 6 


3. 


6 


668 


12 


BJ399430 


BJ399430 


BJ399430 


c 


42 


93. 6 


3. 


6 


669 


12 


BJ431076 


BJ431076 


BJ431076 


c 


43 


93.6 


3. 


6 


685 


12 


BJ398580 


BJ398580 


BJ398580 


c 


44 


93.6 


3. 


6 


688 


12 


BJ397893 


BJ397893 


BJ397893 


c 


45 


93.6 


3. 


6 


692 


12 


BJ377815 


BJ377815 


BJ377815 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 

Run on: January 10, 2004, 08:52:35 ; Search time 2373 Seconds 

(without alignments ) 
3258.289 Million cell updates/sec 



Title: 

Perfect score; 
Sequence : 

Scoring table: 



Searched : 



US-09-919-831-2 
947 

1 MEDDLSAALVKALFDARTQR NPTTVATRYLVALDKRVPRA 18 9 

BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 
Ygapop 10.0 , Ygapext 0.5 
Fgapop 6.0 , Fgapext 7.0 
Delop 6.0 , Delext 7.0 

2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 5777422 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 10 0% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV=xlp 

-DB=GenEmbl -QFMT-fastap -SUFFIX=rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT-0 
-UNITS=bits -STARTS -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 0 000 0 0000 
-USER=US09919831_@CGN_l__l_3745_@runat_09012004_165716_64 4 -NCPU^6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=12 0 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP-6 -DELEXT-7 



Database 



GenEmbl ; 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 



gb_Joa : * 
gb_htg: * 
gb_in: * 
gb_om : * 
gb_ov : * 
gb__pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb ro:* 



11: 


gb s t s : * 


12: 


gb sy:* 


13: 


gb un:* 


14 : 


gb vi : * 


15: 


em ba : * 


16: 


em fun : * 


17 : 


em hum: * 


18 : 


em in : * 


19: 


em mu : * 


20: 


em om:* 


21: 


em or:* 


22 : 


em ov : * 


23: 


em pat : * 


24 : 


em ph : * 


25: 


em pi : * 


26: 


em ro : * 


27 : 


em sts : * 


28: 


em un : * 


29: 


em vi : * 


30: 


em htg hum: * 


31: 


em_htg inv:* 


32: 


em_htg other: * 


33: 


em_htg mus : * 


34 : 


em htg pin : * 


35: 


em htg rod : * 


36: 


em htg mam: * 


37: 


em htg vrt : * 


38 : 


em sy : * 


39: 


em htgo hum : * 


40: 


em htgo mus : * 


41: 


ernjitgo other: 



Preci. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 





No, 


Score 


Match 


Length 


DB 


ID 


Description 




1 


947 


100 


. 0 


567 


6 


AX123146 Ef l(0i1%0 


AX123146 


Sequence 




2 


947 


100 


. 0 


567 


6 


BD165263 Jf J0o2*(? tf7o 


BD165263 


Novel pol 




3 


947 


100 


.0 


2628 


6 


AX3 94 2 63 £J0 02//2V30 


AX394263 


Sequence 


c 


4 


947 


100 


. 0 


320550 


1 


AP005282G</v^i 


AP005282 


Corynebac 


c 


5 


947 


100 


.0 


349980 


6 


AX127152 eniomo 


AX127152 


Sequence 


c 


6 


383 


40 


4 


329709 


1 


AP002997 


AP002997 


Mesorhizo 




7 


347.5 


36 


7 


10148 


1 


AE008997 


AE008997 


Agrobacte 




8 


347.5 


36 


7 


11534 


1 


AE007964 


AE007964 


Agrobacte 


c 


9 


334 


35 


3 


10998 


1 


AE004614 


AE004614 


Pseudomon 




10 


323 


34 


1 


204050 


1 


AL646073 


AL646073 


Ralstonia 




11 


307.5 


32. 


5 


292200 


1 


SC0939129 


AL939129 


Streptomy 




12 


297 


31. 


4 


306650 


1 


SME591783 


AL591783 


Sinorhizo 




13 


289.5 


30. 


6 


300900 


1 


AP005939 


AP005939 


Bradyrhiz 




14 


267 


28. 


2 


10022 


1 


AE005873 


AE005873 


Caulobact 




15 


266 


28 . 


1 


222605 


1 


AP002555 


AP002555 


Escherich 




16 


210.5 


22. 


2 


300956 


1 


AE016963 


AE016963 


Coxiella 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus__p2n model 



Run on: 



Title: 

Perfect score : 
Sequence : 



January 10, 2004, 08:49:14 ; Search time 258 Seconds 

(without alignments) 
1977.495 Million cell updates/sec 

US-09-919-831-2 
947 

1 MEDDLSAALVKALFDARTQR NPTTVATRYLVALDKRVPRA 18 9 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 2552756 seqs, 1349719017 residues 

Total number of hits satisfying chosen parameters: 5105512 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL-frame+_p2n. model -DEV=xlp 

Q=/cgn2 m l/USPTO_spool_p/US09919831/runat_09012004_165716_638/app_query. fasta 1. 
27 

-DB=N_Geneseq_19Jun03 -QFMT-fastap -SUFFIX-rng -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS-bits -START-1 -END=-1 -MATRIX=blosum62 -TRANS=hurnan4 0 . cdi 
-LIST-45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE-LOCAL -0UTFMT=pto -NORM-ext -HEAPSIZE=5 00 -M1NLEN-0 -MAXLEN=2000000000 
-USER-US09919831__@CGN_l__l_3 96_@runat_090120 04_165716_638 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG__SCORES = 0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS-1 -XGAPOP=10 -XGAPEXT-0 . 5 -FGAPOP-6 
-FGAPEXT=7 -YGAPOP-10 -YGAPEXT=0 . 5 -DELOP-6 -DELEXT=7 



Database 



N_Geneseq_ 



1 

2 
3 
4 
5 
6 
7 
8 
9 

10 
11 



/SIDS1 
/SIDS1 
/SIDS1 
/SIDS1 
/SIDS1 
/SIDS1 
/SIDS1 
/SIDS1 
/SIDS1 
/SIDS 
/SIDS 



19Jun03: * 

/gcgdata/gene 

/gcgdata/gene 

/gcgdata/gene 

/ gcgdata/gene 

/gcgdata/gene 

/gcgdata/gene 

/gcgdata/gene 

/gcgdata/gene 

/gcgdata/gene 

1/gcgdata/gen 

1/gcgdata/gen 



seq/geneseqn-embl/NA1980 . DAT : * 
seq/geneseqn-embl/NA1981 . DAT : * 
seq/geneseqn-embl/NA1982 . DAT : * 
seq/geneseqn-embl/NA1983 . DAT : * 
seq/geneseqn-embl/NA1984 . DAT : * 
seq/geneseqn-embl/NA1985 . DAT : * 
seq/geneseqn-embl/NA198 6 . DAT : * 
seq/geneseqn-embl/NA1987 . DAT : * 
seq/geneseqn-embl/NA1988 . DAT : * 
eseq/geneseqn-embl/NA1989. DAT: * 
eseq/geneseqn-embl/NA1990 . DAT : * 



12 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1991 . DAT: * 

13 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1992 . DAT : * 

14 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA19 93 . DAT : * 

15 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1994 . DAT : * 

16 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1995 . DAT: * 

17 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1996 . DAT : * 

18 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1997 . DAT : * 

19 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1998 . DAT: * 

20 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA1999 . DAT : * 

21 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2000 . DAT : * 

22 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2001A. DAT: * 

23 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2 001B . DAT : * 

24 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2002 . DAT: * 

25 : /SIDSl/gcgdata/geneseq/geneseqn-embl/NA2 003 . DAT : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 






Description 




1 


94 7 


100 


.0 


567 


22 


AAH68027 


t V ftoillo 


C glutamicum codin 




2 


947 


100 


. 0 


567 


25 


ACA0 034 9 






C. glutamicum deri 




3 


94 7 


100 


. 0 


2 628 


24 


AAD34148 


CjO 




Corynebacterium gl 


c 


4 


94 / 


100 


.0 


34 998 0 


22 


AAH68533 


£? tlotVlo 


C glutamicum codin 




5 


163 


17 


. 2 


17083 


25 


ABZ58812 






S . cinnamoneus cin 




6 


14 7 


15 


. 5 


663 


24 


ABK78585 






Bacillus clausii g 




7 


146 


15 


. 4 


4 975 


25 


ABZ58814 






DNA sequence of pC 


c 


8 


129 


13 


. 6 


37286 


23 


AAS59522 






Propionibacterium 




9 


127 


13 


.4 


7953 


18 


AAV7 4 4 63 






Staphylococcus aur 




10 


119 


12 


. 6 


240 


21 


AAZ91472 






Sphaerotilus natan 




11 


113 


11 


. 9 


662 


9 


AAN81167 






Operator from Baci 




12 


112 . 5 


11 


. 9 


552 


24 


ABK74142 






Bacillus lichenif o 




13 


112 


11 


8 


1378 


23 


AAS89064 






DNA encoding novel 




14 


110 


11 


6 


362 


24 


ABN20810 






Human ORFX polynuc 


c 


15 


108 


11 


4 


1191 


23 


AAS89918 






DNA encoding novel 


c 


16 


108 


11 


4 


1191 


23 


AAS93813 






DNA encoding novel 


c 


17 


106 


11 


2 


349980 


22 


AAH68530 






C glutamicum codin 




18 


105. 5 


11 


1 


9515 


15 


AAQ55145 






Pseudomonas aerugi 




19 


105.5 


11 


1 


9515 


25 


ABZ77356 






Nucleotide sequenc 




20 


105.5 


11 . 


1 


9515 


25 


AAL51844 






Pseudomonas aerugi 




21 


103 


10. 


9 


291 


25 


ABX06855 






S. pneumoniae type 


c 


22 


103 


10. 


9 


5857 


19 


AAV52161 






Streptococcus pneu 


c 


23 


103 


10. 


9 


1830121 


17 


AAT42063 






Haemophilus influe 




24 


103 


10. 


9 


2162598 


25 


ABS56454 






Streptococcus pneu 




25 


101 


10. 


7 


240 


21 


AAA13364 






SgrAI control prot 




26 


100 


10. 


6 


540 


22 


AAH52849 






S. epidermidis ope 




27 


100 


10. 


6 


552 


24 


ABN91111 






Staphylococcus epi 


c 


28 


100 


10. 


6 


3364 


22 


AAH54269 






S. epidermidis gen 




29 


99.5 


10. 


5 


1799 


23 


AAS93668 






DNA encoding novel 




30 


99.5 


10. 


5 


21185 


21 


AAA63350 






Streptomyces globi 




31 


99.5 


10. 


5 


63164 


21 


AAA63348 






Streptomyces globi 


c 


32 


98.5 


10. 


4 


16870 


20 


AAX13035 






Enterococcus faeca 


c 


33 


98.5 


10. 


4 


16870 


24 


ABS98830 






Enterococcus faeca 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



January 10, 2004, 11:21:50 



; Search time 79 Seconds 

(without alignments ) 

1055.967 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 



US-09-919-831-2 
947 

1 MEDDLSAALVKALFDARTQR 



NPTTVATRYLVALDKRVPRA 18 9 



Scoring table: 



BLOSUM62 



Xgapop 10.0 
Ygapop 10.0 
Fgapop 6 . 0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



569978 seqs, 220691566 residues 



Total number of hits satisfying chosen parameters: 



1139956 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters : 
-MODEL=frame+_p2n. model -DEV-xlp 

Q=/cgn2_l/USPTO_spool_p/US09919831/runat_09012004_165717 656/app query. fasta 1 3 
27 

-DB=Issued_Patents_NA -QFMT=fastap -SUFFIX=rni -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START-1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST-45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR__MIN=0 -ALIGN=15 
-MODE-LOCAL -0UTFMT=pto -NORM=ext -HEAPSI ZE=5 00 -MINLEN=0 -MAXLEN-2 0 00 0 0 0 0 0 0 
-USER==US099198 31__@CGN_l_l_8 5_@runat_0 901200 4_1 65717^656 -NCPU=6 -ICPU=3 
-NO__MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK-100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT-30 ~THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP-10 -YGAPEXT=0 . 5 -DELOP-6 -DELEXT-7 

Database : Issued Patents NA:* 



1 : / cgn2_6/ptodata/2 /ina/5A_COMB . seq: * 

2 : /cgn2_6/ptodata/2/ina/5B_COMB. seq: * 

3: /cgn2_6/ptodata/2/ina/6A_COMB. seq: * 

4 : /cgn2_6/ptodata/2/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/2/ina/PCTUS_COMB. seq: * 

6: /cgn2_6/ptodata/2/ina/backfilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 








Description 


c 


1 


334 


35 


. 3 


732 


4 


us- 


09- 


252- 


991A-13576 


SpQllpn pp 
LA i L-* 


13576 A 

-A. -^J *-/ 1 \J f ii 




2 


334 


35 


. 3 


1098 


4 


us- 


09- 


252- 


991A-13766 


Spcmpn CO 

^ L_J LA {—< 1 1 L_* v- 


13766 A 

-A. I \J \J f 11 




3 


334 


35 


. 3 


1731 


4 


us- 


09- 


252- 


991A-13674 


SPfTUPFi fP 
V3 la^; 1 1 \ 


13674 A 


c 


4 


326 


34 


. 4 


1347 


4 


us- 


09- 


252- 


991A-13659 


Sequence 


13659 A 




5 


265.5 


28 


. 0 


603 


4 


US- 


09- 


328- 


352-2710 


Sequence 


2710 An 


c 


6 


195 


20 


. 6 


891 


4 


us- 


09- 


252- 


991A-13471 


S pen ] pn 


13471, A 




7 


171 


18 


. 1 


46819 


4 


us- 


09- 


453- 


702B-72 


S e quence 


72, AddI 




8 


157 


16 


. 6 


636 


4 


us- 


09- 


252- 


991A-8938 


cjprn] pn CP 

^ L_J la Cll 


893R An 




9 


155 


16 


. 4 


531 


4 


us- 


09- 


252- 


991A-8719 


S e quence 


8719 An 




10 


129.5 


13 


. 7 


561 


4 


us- 


09- 


252- 


991A-5411 


Secjuence 


5 411 An 




11 


129.5 


13 


. 7 


741 


4 


us- 


09- 


252- 


991A-5419 


>J ^ la v_- i i 


5 419 An 




12 


129.5 


13 


. 7 


879 


4 


us- 


09- 


252- 


991A-5407 


Secrupn CP 

k> ^ LA V_- 1 1 ^ 


5 4 07 An 


c 


13 


129.5 


13 


. 7 


1593 


4 


us- 


09- 


252- 


991A-5392 


Q prrinpripp 
UCL^UCllL-C 


5^9? An 




14 


121.5 


12 


. 8 


783 


4 


us- 


09- 


252- 


991A-16173 


Senupn fP 

KJ \ — - \— 4 LA LU 1 X <^ 


16173 A 




15 


119 


12 


. 6 


240 


3 


us- 


09- 


143- 


776-5 


S pciupn r*p 


5 Ann 1 \ 




16 


118.5 


12 


. 5 


627 


4 


us- 


09- 


252- 


991A-7163 


Seouenre 


7163 An 




17 


105.5 


11 


. 1 


9515 


1 


us- 


08- 


920- 


812-13 


S e qusncG 


13, Annl 




18 


105.5 


11 


1 


9515 


1 


us- 


08- 


920- 


827-13 


Sprrupn rp 


13 Ann! 




19 


105.5 


11 


1 


9515 


1 


us- 


08- 


921- 


177-13 


Cpnnpnrp 

O \3 LJ. ^3 1 1 V_- ^3 


1 3 Annl 




20 


105.5 


11 


1 


9515 


1 


us- 


08- 


362- 


577C-13 


Q prri ipnpp 

i-J \^ \-A LA ^ 1 1 L^ v3 


1 3 Annl 




21 


105.5 


11 


1 


9515 


2 


us- 


08- 


920- 


828-13 


S P CU PTl C P 

1^ ^ LA 1 1 Ly 


1 3 Annl 




22 


103.5 


10 


9 


582 


4 


us- 


09- 


252- 


991A-15732 


Sp mi pn p p 

^ L_J LA ^3 1 1 L^* ^ 


1 573? A 


c 


23 


103 


10 


9 


5857 


4 


us- 


08- 


961- 


527-28 


S ecjuence 


2 8 Annl 


c 


24 


103 


10 


9 


1830121 


4 


us 


-09 


-557 


-884-1 


S e cju e n c e 


1 Ann! i 


c 


25 


103 


10 


9 


1830121 


4 


us 


-09 


-643 


-990A-1 


'-lpmipn r^p 


1 Annl -i 




26 


101 


10 


7 


240 


3 


us- 


09- 


146- 


675-3 


5-! p ni lpnrp 

^ L^ k_A C 4. J. L_^ 53 


3 Ann 1 i 




27 


100.5 


10 


6 


639 


4 


us- 


09- 


328- 


352-2047 


Se cjuence 


2 0 4 7, An 




28 


100 


10 


6 


552 


4 


us- 


09- 


134- 


001C-574 


S prrn pn rp 

k-J LJ LA 1 1 


574 Ann 




29 


100 


10 


6 


1062 


4 


us- 


09- 


252- 


991A-3271 


5-1 P m lpnpp 


3? 7] An 




30 


99 


10 


5 


570 


4 


us- 


09- 


252- 


991A-12340 


kJCLjUCIlL.C 


1934 0 A 


c 


31 


98 


10 


3 


28882 


4 


us- 


08- 


961- 


527-140 


Sp cu pn pp 


14 0 Ann 


c 


32 


94.5 


10. 


0 


1347 


4 


us- 


09- 


252- 


991A-5399 


Sequence 


53 9 9, Ad 




33 


94 


9. 


9 


855 


4 


us- 


09- 


252- 


991A-867 


S pan pn rp 

kJ LA W 1 1 Lj 


8 67 Ann 


c 


34 


94 


9. 


9 


1085 


4 


us- 


08- 


624- 


44 7-3 


Sequence 


3, Appli 


c 


35 


91 


9. 


6 


46899 


1 


us- 


08- 


471- 


119A-1 


Sequence 


1, Appli 


c 


36 


89.5 


9. 


5 


4403765 


3 


us 


-09 


-103 


-840A-2 


Sequence 


2, Appli 


c 


37 


89.5 


9. 


5 


4411529 


3 


us 


-09 


-103 


-840A-1 


Sequence 


1, Appli 




38 


88 . 5 


9. 


3 


1851 


3 


us- 


08- 


303- 


861-11 


Sequence 


11, Appl 




39 


88.5 


9. 


3 


23673 


4 


us- 


09- 


773- 


816-1 


Sequence 


1, Appli 




40 


87.5 


9. 


2 


552 


4 


us- 


09- 


252- 


991A-9619 


Sequence 


9619, Ap 


c 


41 


87 .5 


9. 


2 


3006 


4 


us- 


09- 


252- 


991A-9720 


Sequence 


9720, Ap 




42 


86.5 


9. 


1 


4403765 


3 


us 


-09 


-103 


-840A-2 


Sequence 


2, Appli 




43 


86.5 


9. 


1 


4411529 


3 


us 


-09 


-103 


-840A-1 


Sequence 


1, Appli 




44 


86 


9. 


1 


360 


4 


us- 


09- 


252- 


991A-3519 


Sequence 


3519, Ap 




45 


86 


9. 


1 


360 


4 


us- 


09- 


252- 


991A-8563 


Sequence 


8563, Ap 



ALIGNMENTS 

RESULT 1 

US-09-2 52-991A-1357 6/C 

; Sequence 13576, Application US/09252991A 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus__p2n model 



Run on: 



January 10, 2004, 12:12:10 



; Search time 280 Seconds 

(without alignments) 

2344.014 Million cell updates/sec 



Title: 

Perfect score : 
Sequence : 



US-09-919-831-2 

947 

1 MEDDLSAALVKALFDARTQR 



NPTTVATRYLVALDKRVPRA 18 9 



Scoring table : 



BLOSUM62 



Xgapop 10.0 
Ygapop 10.0 
Fgapop 6 . 0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



2276164 seqs, 1736306516 residues 



Total number of hits satisfying chosen parameters: 



4552328 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlp 

Q-/cgn2__l/USPTO_spool_p/US09919831/runat_09012004_165718_694/app_query. f asta_l . 3 
27 

-DB=Published_Applications_NA -QFMT^fastap -SUFFIX=rnpb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START-1 -END=-1 -MATRIX=blosum62 
-TRANS=human4 0 . cdi -LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 
-THR_MIN=0 -ALIGN=15 -M0DE=L0CAL -OUTFMT-pto -NORM-ext -HEAPSI ZE=5 00 -MINLEN=0 
-MAXLEN=2000000000 -USER-US 09 91 98 31_@CGN__l_l_85_@runat_0 90 120 04_1 65 718^6 94 
-NCPU=6 -ICPU=3 ~NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=l 00 
-LONGLOG -DEV__TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 
-FGAPOP=6 -FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : Published Applications NA: * 



1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB. seq: * 

3: /cgn2_6/ptodata/2/pubpna/US0 6_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6: /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9: /cgn2_6/ptodata/2/pubpna/US0 9A_PUBCOMB. seq: * 

10 : /cgn2_6/ptodata/2/pubpna/US09B__PUBCOMB. seq: * 

11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 



12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 

13 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB . seq2 : * 

14 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 

15 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB. seq: * 

16 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 

17 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 

18 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB.seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

o 
o 

Result Query 

No. Score Match Length DB ID Description 



1 


947 


100 


.0 


567 


10 


us- 


09- 


738- 


626-3062 


Sequence 


3062, Ap 


2 


947 


100 


.0 


2628 


10 


us- 


09- 


919- 


831-1 


Sequence 


1, Appli 


3 


947 


100 


.0 


3309400 


10 


us 


-09 


-738 


-626-1 


Sequence 1, Appl 


4 


171 


18 


.1 


46819 


15 


us- 


10- 


114- 


170-72 


Sequence 


72, Appl 


5 


162 


17 


.1 


603 


15 


us- 


10- 


156- 


761-7019 


Sequence 


7019, Ap 


6 


162 


17 


1 


9025608 


15 


us 


-10 


-156 


-761-1 


Sequence 


i l f Appl 


7 


147 


15 


5 


663 


10 


us- 


09- 


974- 


300-5876 


Sequence 


5876, Ap 


8 


142 


15 


0 


624 


15 


us- 


10- 


156- 


761-1633 


Sequence 


1633, Ap 


9 


142 


15 


0 


9025608 


15 


us 


-10 


-156 


-761-1 


Sequence 


i 1, Appl 


10 


139 


14 


7 


573 


15 


us- 


10- 


156- 


761-6696 


Sequence 


6696, Ap 


11 


127 


13 


4 


7953 


8 US-08-781-986A-152 


Sequence 152, App 


12 


112.5 


11 


9 


552 


10 


us- 


09- 


974- 


300-1433 


Sequence 


1433, Ap 


13 


103 


10 


9 


1830121 


15 


us 


-10 


-329 


-960-1 


Sequence 


i If Appl 


14 


99.5 


10 


5 


63158 


13 


us- 


10- 


292- 


198-1 


Sequence 


1, Appli 


15 


98 .5 


10 


4 


16870 


10 


us- 


09- 


070- 


927A-98 


Sequence 


98, Appl 


16 


97 


10 


2 


402 


10 


us- 


09- 


974- 


300-3201 


Sequence 


3201, Ap 


17 


97 


10 


2 


504 


10 


us- 


09- 


974- 


300-5801 


Sequence 


5801, Ap 


18 


96 


10 


1 


12445 


10 


us- 


09- 


070- 


927A-242 


Sequence 


2 4 2, App 


19 


93 


9 


8 


1774 


12 


us- 


10- 


369- 


493-35333 


Sequence 


35333, A 


20 


92 


9 


7 


312 


10 


us- 


09- 


738- 


626-3219 


Sequence 


3219, Ap 


21 


92 


9 


7 


671 


13 


us- 


10- 


027- 


632-126891 


Sequence 


126891, 


22 


92 


9 


7 


671 


14 


us- 


10- 


027- 


632-126891 


Sequence 


126891, 


23 


92 


9 


7 


723 


15 


us- 


10- 


156- 


761-2942 


Sequence 


2942, Ap 


24 


92 


9 


7 


3309400 


10 


us 


-09 


-738 


-626-1 


Sequence 


1, Appl 


25 


91.5 


9 


7 


1299 


15 


us- 


10- 


156- 


761-1179 


Sequence 


1179, Ap 


26 


91 


9 


6 


25085 


13 


us- 


10- 


132- 


134-35 


Sequence 


35, Appl 


27 


91 


9 


6 


50543 


13 


us- 


10- 


132- 


134-25 


Sequence 


25, Appl 


28 


90 


9 


5 


474 


15 


us- 


10- 


156- 


761-3996 


Sequence 


3996, Ap 


29 


90 


9 


5 


1977 


12 


us- 


10- 


369- 


493-41620 


Sequence 


41620, A 


30 


89.5 


9 


5 


1041 


15 


us- 


10- 


156- 


761-1203 


Sequence 


1203, Ap 


31 


89 


9 


4 


671 


13 


us- 


10- 


027- 


632-126892 


Sequence 


126892, 


32 


89 


9, 


4 


671 


14 


us- 


10- 


027- 


632-126892 


Sequence 


126892, 


33 


89 


9. 


4 


3234 


13 


us- 


10- 


214- 


446-23 


Sequence 


23, Appl 


34 


88.5 


9. 


3 


14427 


15 


us- 


10- 


156- 


761-1540 


Sequence 


1540, Ap 


35 


88 


9. 


3 


321 


10 


us- 


09- 


738- 


626-2152 


Sequence 


2152, Ap 


36 


88 


9. 


3 


2079 


15 


us- 


10- 


156- 


761-6483 


Sequence 


6483, Ap 


37 


88 


9. 


3 


2808 


15 


us- 


10- 


156- 


761-2021 


Sequence 


2021, Ap 


38 


87 .5 


9. 


2 


405 


10 


us- 


09- 


974- 


300-3648 


Sequence 


3648, Ap 


39 


87.5 


9. 


2 


618 


15 


us- 


10- 


156- 


761-7117 


Sequence 


7117, Ap 


40 


87 


9. 


2 


519 


15 


us- 


10- 


153- 


668-457 


Sequence 


4 57, App 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rarae_plus_p2n model 



Run on : 



January 10, 2004, 10:53:25 



; Search time 2062 Seconds 

(without alignments ) 

2227.714 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 



US-09-919-831-2 
947 

1 MEDDLSAALVKALFDARTQR 



NPTTVATRYLVALDKRVPRA 18 9 



Scoring table: 



BLOSUM62 



Xgapop 10.0 
Ygapop 10.0 
Fgapop 6 . 0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched : 



22781392 seqs, 12152238056 residues 



Total number of hits satisfying chosen parameters: 



45562784 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=f rame+__p2n. model -DEV=xlp 

Q-/cgn2_l/USPTO_spool_p/US09919831/runat_09012004__165717__650/app_query. fasta_l . 3 
27 

-DB=EST -QFMT=fastap -SUFFIX=rst -MINMATCH=0 . 1 -LOOPCL-0 -LOOPEXT-0 
-UNITS=bits -START-1 -END=-1 -MATRIX=blosurn62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN-200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGKN15 -MODE=LOCAL 
-OUTFMT-pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US09919831_@CGN_l_l_2 8 74_@runat_09012 00 4_165717_65 0 -NCPU=6 -ICPU=3 
-NOJ4MAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP^IO -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : EST:* 



1: em_estba:* 

2: em_esthum: i 

3: em_estin:* 

4: em__estmu: * 

5: em_estov:* 

6: em_estpl:* 

7: em_estro:* 

8 : em htc: * 



9: gb_estl:* 
10: gb_est2:* 
11: gb_htc:* 



12 


gb 


_est3 : * 




13 


gb 


est4 : * 




14 


gb_ 


est5 : * 




15 


em 


est fun : * 




16 


em 


estom: + 




17 


em 


gss hum: 


* 


18 


em 


_gss inv: 


* 


19 


em 


gss pin: 


* 


20 


em 


gss vrt : 




21 


em 


gss fun: 




22 


em 


gss mam: 




23 


em 


gss mus : 


■A- 


24 


em 


gss pro: 




25 


em 


gss rod: 


* 


26 


em 


gss phg: 




27 


em 


gss vrl : 


* 


28 


gb" 


gssl : * 




29 


gb_ 


gss2 : * 





Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

a 
o 



Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 




1 


281 


29 


. 7 


1390 


29 


BZ551614 


BZ551614 


pacs 1-60 


c 


2 


196 


20 


.7 


833 


29 


BZ570253 


BZ570253 


msh2 127 . 




3 


124 


13 


. 1 


799 


29 


BZ559712 


BZ559712 


pacs2-164 


c 


4 


111 


11 


.7 
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